Search CORE

3 research outputs found

Functional modeling of high-dimensional data: a Manifold Learning approach

Author: Aguilera-Morillo M. Carmen
Hernández-Roig Harold A.
Lillo Rodríguez Rosa Elvira
Publication venue: MDPI
Publication date: 02/02/2021
Field of study

This article belongs to the Special Issue Methodological and Applied Contributions on Stochastic Modelling and ForecastingThis paper introduces stringing via Manifold Learning (ML-stringing), an alternative to the original stringing based on Unidimensional Scaling (UDS). Our proposal is framed within a wider class of methods that map high-dimensional observations to the infinite space of functions,allowing the use of Functional Data Analysis (FDA). Stringing handles general high-dimensional data as scrambled realizations of an unknown stochastic process. Therefore, the essential feature of the method is a rearrangement of the observed values. Motivated by the linear nature of UDS and the increasing number of applications to biosciences (e.g., functional modeling of gene expression arrays and single nucleotide polymorphisms, or the classification of neuroimages) we aim to recover more complex relations between predictors through ML. In simulation studies, it is shown that MLstringing achieves higher-quality orderings and that, in general, this leads to improvements in the functional representation and modeling of the data. The versatility of our method is also illustrated with an application to a colon cancer study that deals with high-dimensional gene expression arrays.This paper shows that ML-stringing is a feasible alternative to the UDS-based version. Also, it opens a window to new contributions to the field of FDA and the study of high-dimensional data.This research was funded in part by Ministerio de Ciencia, Innovación y Universidades grant numbers PID2019-104901RB-I00 and MTM2017-88708-P

Universidad Carlos III de Madrid e-Archivo

Functional Modeling of High-Dimensional Data: A Manifold Learning Approach

Author: Harold A. Hernández-Roig
M. Carmen Aguilera-Morillo
Rosa E. Lillo
Publication venue: 'MDPI AG'
Publication date: 19/02/2021
Field of study

This paper introduces stringing via Manifold Learning (ML-stringing), an alternative to the original stringing based on Unidimensional Scaling (UDS). Our proposal is framed within a wider class of methods that map high-dimensional observations to the infinite space of functions, allowing the use of Functional Data Analysis (FDA). Stringing handles general high-dimensional data as scrambled realizations of an unknown stochastic process. Therefore, the essential feature of the method is a rearrangement of the observed values. Motivated by the linear nature of UDS and the increasing number of applications to biosciences (e.g., functional modeling of gene expression arrays and single nucleotide polymorphisms, or the classification of neuroimages) we aim to recover more complex relations between predictors through ML. In simulation studies, it is shown that ML-stringing achieves higher-quality orderings and that, in general, this leads to improvements in the functional representation and modeling of the data. The versatility of our method is also illustrated with an application to a colon cancer study that deals with high-dimensional gene expression arrays. This paper shows that ML-stringing is a feasible alternative to the UDS-based version. Also, it opens a window to new contributions to the field of FDA and the study of high-dimensional data

Multidisciplinary Digital Publishing Institute

Estimating the COVID-19 Prevalence in Spain With Indirect Reporting via Open Surveys

Author: Baquero Carlos
Fernandez Anta Antonio
Frey Davide
Garcia-Agundez Augusto
Georgiou Chryssis
Goessens Mathieu
Hernández-Roig Harold A.
Lillo Rosa E.
Menezes Raquel
Nicolaou Nicolas
Ojo Oluwasegun
Ortega Antonio
Stavrakis Efstathios
Publication venue: Frontiers
Publication date: 01/01/2021
Field of study

During the initial phases of the COVID-19 pandemic, accurate tracking has proven unfeasible. Initial estimation methods pointed toward case numbers that were much higher than officially reported. In the CoronaSurveys project, we have been addressing this issue using open online surveys with indirect reporting. We compare our estimates with the results of a serology study for Spain, obtaining high correlations (R squared 0.89). In our view, these results strongly support the idea of using open surveys with indirect reporting as a method to broadly sense the progress of a pandemic

HAL-CentraleSupelec

TUbiblio

Universidade do Minho: RepositoriUM

INRIA a CCSD electronic archive server

HAL Descartes

Universidad Carlos III de Madrid e-Archivo

tuprints

Hal-Diderot

HAL-Rennes 1